List of Flash News about machine reasoning
| Time | Details |
|---|---|
|
2026-02-12 21:01 |
Gemini 3 Deep Think Achieves New Records in AI Benchmarks
According to Demis Hassabis, the Gemini 3 Deep Think model has undergone a significant upgrade, achieving groundbreaking results in key AI performance benchmarks. These include an 84.6% score on ARC-AGI-2, 48.4% on Humanity’s Last Exam without tool assistance, and a 3455 Elo rating on Codeforces. These advancements underline the model's capabilities in mathematics, science, and reasoning, providing notable implications for AI-driven innovation and applications. |